An EM Approach to Learning Sequential Behavior
نویسندگان
چکیده
We consider problems of sequence processing and we propose a solution based on a discrete state model. We introduce a recurrent architecture having a modular structure that allocates subnetworks to discrete states. Di erent subnetworks are model the dynamics (state transition) and the output of the model, conditional on the previous state and an external input. The model has a statistical interpretation and can be trained by the EM or GEM algorithms, considering state trajectories as missing data. This allows to decouple temporal credit assignment and actual parameters estimation. The model presents similarities to hidden Markov models, but allows to map input sequences to output sequences, using the same processing style of recurrent networks. For this reason we call it Input/Output HMM (IOHMM). Another remarkable di erence is that IOHMMs are trained using a supervised learning paradigm (while potentially taking advantage of the EM algorithm), whereas standard HMMs are trained by an unsupervised EM algorithm (or a supervised criterion with gradient ascent). We also study the problem of learning long-term dependencies with Markovian systems, making comparisons to recurrent networks trained by gradient descent. The analysis reported in this paper shows that Markovian models generally su er from a problem of di usion of temporal credit for long-term dependencies and fully connected transition graphs. However, while recurrent networks exhibit a con ict between long-term information storing and trainability, these two requirements are either both satis ed or both not satis ed in Markovian models. Finally, we demonstrate that EM supervised learning is well suited for solving grammatical inference problems. Experimental results are presented for the seven Tomita grammars, showing that these adaptive models can attain excellent generalization. 1
منابع مشابه
Comparing Bandwidth and Self-control Modeling on Learning a Sequential Timing Task
Modeling is a process which the observer sees another person's behavior and adapts his/her behavior with that which is the result of interaction. The aim of present study was to investigate and compare effectiveness of bandwidth modeling and self-control modeling on performance and learning of a sequential timing task. So two groups of bandwidth and self-control were compared. The task was pres...
متن کاملDetection of children's activities in smart home based on deep learning approach
Monitoring behavior of children in the home is the extremely important to avoid the possible injuries. Therefore, an automated monitoring system for monitoring behavior of children by researchers has been considered. The first step for designing and executing an automated monitoring system on children's behavior in closed spaces is possible with recognize their activity by the sensors in the e...
متن کاملAN APPROACH TO THE A RING OF VITAMIN D ANALOGUES VIA SEQUENTIAL CARBOMETALATION/ ANION CAPTURE
An intramolecular palladium catalysed carbometalation followed by anion capture achieves construction of a model comprised of the A ring of Vitamin D oxygen analogues.
متن کاملSequential EM learning for subspace analysis
Subspace analysis is one of popular multivariate data analysis methods, which has been widely used in pattern recognition. Typically data space belongs to very high dimension, but only a few principal components need to be extracted. In this paper, we present a fast sequential algorithm which behaves like expectation maximization (EM), for subspace analysis or tracking. In addition we also pres...
متن کاملInvestigating the Causes of Divorce through Narrative Analysis in Yazd City and Designing a Prerequisite Education based on the Causes of Divorce using a Hidden Learning Approach on the basis of Family, School, and Student
Introduction: Today, divorce is a well-known and dangerous social phenomenon that disintegrates families and corrupts the society. Therefore, this study aimed to investigate the causes of divorce through narrative analysis in Yazd City and to design a prerequisite education based on the causes of divorce using a hidden learning approach on the basis of family, school, and student approach. Met...
متن کاملInvestigating Predictors of High School Students’ Negative Attitudes Towards Learning English by Developing, Validating, and Running a Questionnaire
The purpose of this study was to explore the predictors of negative attitudes towards learning English from L2 learners’ points of view. A mixed methods research approach was adopted with a sequential exploratory design, followed by an endorsement phase. Eighteen high school students in Fars province (Iran) were interviewed on the sources of negative attitudes towards learning English. Based on...
متن کامل